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ABSTBACT 

At vowel onset following unvoiced consonants in 
/h-cvc/ utterances spoken by two talkers, Fo began high and fell 
about seven percent in the first five centiseconds. At closure of 
7oiced oral obstruents, Fo suddenly dipped about ten percent, 
remained flat, suddenly rose about twenty five percent at opening of 
closure, and, after vowel onset, gradually rose. The high/low feature 
of the vowel, and manner and place of prevocalic consonant 
articulation had progressively less effects on vowel Fo values. The 
final consonant had no apparent effect on Fo values in the vowel. As 
previous synthesis work has suggested, the f^ill or rise of Fo in the 
initial portion of a vowel appears to be a cue to the state of 
voicing of previous consonants. Initial and peak Fo values in the 
vowel also can indicate state of consonant voicing. However, Fo 
contours in hisyllabic words with contrasting stress patterns and 
similar phonemic sequences i^howed that: (1) an initially-falling Fo 
in a vowel may indicate either previous unvoiced consonant or an 
unstressed vowel, and (2) a rising Fo contour may indicate eitber a 
word-initial vowel, a preceding voiced consonant, or a stressed 
vowel. (Author/DD) 
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ABSTRACT 



At vowel onset following unvoiced consonants in /h^CSfC/ utterances 
spoken by two t&lkers, Fq began high (thirty percent higher than in /©/), 
and fell about seven percent in the first five centiseconds. At closure of 
voiced oral obstruents, Fo suddenly dipped about ten percent, remained flat, 
suddenly rose about twenty five percent at opening of closure, and, after 
vowel onset, gradually rose (an average of eight percent in the first ten 
eentiseconds) . The high/low feature of the vowel, and manner and pl?ce of 
prevocallc consonant articiilation had progressively less effects on vowel 
Fo values. The final consonant had no apparent effect on Fq values in the 
vowel. As previous synthesis work has suggested, the fall or rise of Fq in 
the initial portion of a vowel appears to be a cue to the state of voicing 
of previous consonants. Initial and peak Fq values in the vowel also can 
indicate state of consonant voicing. However, Fo contours in bi syllabic 
words vdth contrasting stress patterns and similar phonemic sequences (e.g«, 
permito permit) showed that: (1) an initially-falling Fo in a vowel may 
indicate either previous unvoiced consonant or an unstressed vowel, and 
(2) a rising Fq contour may indicate either a word-initial vowel, a pre- . 
ceding voiced consonant^* or a stressed vowel. 
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INFLUENCES OP PHONEl'IC SEQUENCES AND STRESS 

ON 

FmroMENTAL FREQUENCY CONTOURS OF ISOLATED WORDS 
Wayne A. Lea 

* . 

9 1o Introduction 

Voicing of consoxiants cannot always be detected simply from whether or 
not the fundamental frequency (F^) contour is continuous, since at least 
some devices for tracking fundamental frequency do not yield continuous con- 
tours throughout consonants, ax\d since phonemically unvoiced consonants are 
sometimes voiced, at least for their initial several centi seconds (Snow .and 
Hiaghes, 1969; Lea, 1972, Ch, It has been suggested (Haggard, Ambler and 

Callow, 1970) that pitch is a voicing cue, in that voice fundamental frequen- 
cy (F^) will progressively increase in the beginnings of vowels after voiced 
consonants, while F^ starts high and progressively decreases following un-- 
voiced consonants (cf» also Lehiste, 1970, p© 

The present study showed that the fall or rise of F^ is indicative of 
consonant voicing, but that the consonant and vowel identities, and the 
stress pattern, demand qualifications to the notion of pitch as a voicing 
cm. The rising F^ after voiced consonants was not expected to occtar fol- 
lowing voiced strident fricatives, (Haggard, Ambler, and Callow, 1970, p. 
61 6), Data given in this paper show no such differences between strident 
and nonstrident fricative s. 

2« Fn Contours in Isolated ChaCVC3 Utterances 

A computer program (Snow and Hughes, 1969) was used to extract and plot 
F^ contours from acoustic data for over 320 isolated fha CVCj utterances, each 
spoken by two talkers. All symmetric C^V^C^ sequences (with C-j^C^) were in- 
cluded, where C^ was any English consonant which can occur In both jjosi- 
tions (ioe,,/p, t, k, b, d, g, f , 6 , s, X,tr , v, z, dj , m, l/)* 
Each of the twelve vowels ft,e^*, at^ a, o, a, o^tt, u / occurred 

as the stressed vowel ("Vg") in each syoometric consonantal context. Asym- 
metric CVC combinations were also recorded, for cases where C^ was /j, r, 
\ w, h/ or silence and C^ was one of the six stops/p, t, k, b, d, g/. Other 

asymmetric CVC combinations paited stops in the C^ position with final C^ 
being either /O, silence (no final C^). 
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Contoiar parameters as defined in Fi,gur8 1 were selected to measiire the 

effects, on contours, of the state of voielng and manner and place of 

articulation of and C^* Also considered were the effects of vowel 

identity on such parameters. A judgment was made of "ohe value F^(?1) of 

representative of the unstressed Co3 and the value F^(Cl) representative of 

F values within voiced medial consonantst The initial F within the stressed 
o o 

vowel, ^i)j was determined as that value at voicing onset after phonetically 
imvoiced consonants, or at the position of most rapid energy and i*orraant 
etriicture change after voiced consonants. Peak F^ in the stressed vowel, 
PF^(V2), was usually immediately after voicing onset following unvoiced con- 
sonants^ but nearer the center of the vowel after voiced consonants* The 
differences andA^gF;^ between F^ five or ten centiseconds after vowel 
onset and those at vowel onset, were positive if F^ were rising in the 
Initial part of the vowel, and negative if F^ were falling in the initial 
part of the vowel. 

Central F^ values within obstruents were about 15^ lower than those in 
non-vowel sonorants (Chistovich, 1969). Peak F^ in the vowel was generally 
10 to 15/S higher with surrounding unvoiced consonants than with siorrounding 
voiced consonants. Effects of manner of consonant articulation on F^ val- 
ues are shown in Figure 2^ where peak F^ in Hertz is plotted for each vow- 
el, with consonants pooled into each manner class. Thus, unvoiced frica- 
tives usually gave the highest peak F^ in vowels, especially when spoken 
by talker ASH, while, for some low vowels, talker GWH showed highest F^ in 
the environment of unvoiced stops. .Similar cuives were obtained for 
Initial F^ in the stressed vowels (Lea, 1972), and for this limited data 
from two talkers, 98^ of all stops and fricatives could be correctly cat- 
egorized as unvoiced or voiced by a simple hypothesis that "if initial F^ 
in is greater than a threshold firequency (155 Hz for ASH, 157 for GWH), 
is unvoiced; otherwise it is voiced"© 

Figure 2 also clearly shov/s that high vowels (near the right and left 
extremes of the plots) generally yield higher F^ values than low vo^/ols 
(near the centers of the plots). 

Consonant showed no substantial effects en PF (V2), F (i), or other 
critical F values defined in Figure 1. 
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Effects of phonotio sequences on transition s In values from phoneme 
to phoneme wore also detenained. V/ith unvoiced consonantal contexts, in- 
creased approximately 4-0 Hz (about 30/0 in transition ftom the unstrese^ed Cd3 
to the beginning of the strasaad vowel (and then r^teadily fall throughout 
the remainder of the utterance). For voiced stops and fricatives, F^ fell 
about 12 Hz (about 10$S) in transition from theC©3 into the consonant, then 
rose an average of about 25 Hz (20^) to the Initial value F^(i) in the stres- 
sed vowel, and continued to rise to a peak F^ about 10 or 15 Hz higher than 

F {!)• No substantial dip occurs in medial nonvowel sonorants, for which F 
o o 

generally rises steadily from F^(Vl), to Fq(C1), to F^Ci), and to the peak value 
PF (V2) in the stressed vovel. 

Table I illustrates the reliability of detecting voicing/unvoicing from 
the values of either AF parameter* For example, the simple hypothesis that 
"a rise in F at vowel onset marks a preceding voiced consonant, and a fall 
marks an unvoiced consonant" wotald be correct for 95% of all consonants for 
jiSH, and 9056 for GWH, if "a rise" meant £!k^F>0 and "a fall" that A^F^O. Sep- 
aration between unvoiced and voiced consonants by the size of any AF is more 
reliable for A^qF then for A^F. 

Thus, while the rise or fall of F^ after vowel onset in thoCV^^] ut- 
terances is a good cue to the state of voicing of the prevocalic ccx;aonant, 
it is not perfectly reliableo 

The results showed a slight dependency of AF parameters on the manner 
and place of consonant articulation, biat considerably more data, for other 
talkers, would be needed to substantiate the slight trends indicated (Lea, 1972). 

Contrary to a conjecture set forth Haggard, Ambler, and Callow 
(1970, p« 61 6), voiced strident fricatives did cause substantial rises in 
Fq following onset of the following vowel, just as other voiced consonants 
did. Haggard, Anbler, and Callow had contended that, since the glottis is 
open partially for the stiddent fricativcis, cessation of glottal vibration 
will occur \d.th them, and the F^ jump and following fall of an unvoiced fri- 
cative may then be observed. While it is true that glottal vibration, as 
indicated by computer-determined F^ values, ceased for some portion of the 
closure time for 17 (71%) of the 24. medial strident fricatives [z^^ spoken 
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by the two talkers, it also temporarily ceased for 25 (52?f) of the 4^ non- 
strident medial fricatives Cv3 and cS3» Wliat is more, the average rises 
(that iS| parameters) for strident fricatives (see Figure U) are compar- 
able to those of their nonstrident [v3 and coimterparts, aiid the number 
of falls after strident [z] fricatives was 1 (4^), compared to U (8^) for 
(twice as many) nonstrident fricatives. 

The most dominant effect of vowel identity on the contour parameters 
A^F and/i,|QF was due to the vowel features high/low. Low vowels tended to 
yield somewhat flatter contours at vowel onset, for both unvoiced and voiced 
preceding consonantso Place. of consonant articulation had little effect on 
AF parameters^ Manner of consonant articulation had some effect onAF parameters 
(at least for talicer ASH), with voiced fricatives and sonorants yielding 
slightly larger AF values than voiced stops did. For talker ASH, unvoiced 
fl*icatives were accompanied by more prominent falls for F^ after vowel on- 
set than unvoiced stops were. 

3a Effects of Word-Stress Contrasts on Fr> Contours 

All ChaC^VCgD words exhibit stress on the second vowel, so that studying 
F^ contours in the C^V section of such words demonstrated effects when the 
consonant precedes a stress peak. Studying the VC^ sections indicated the 
lack of any significant effects of a word-final consonant following a stressed 
vowel* To study effects of word stress on F^ contour parameters, sixty bi- 
syllabic words with contrasting stress patterns, (e.g., perm it /pe rmit ; myst ic/ 
mys tique ) were recorded by the same two talkers. A general tendency toward 
higher F^ values in stressed syllables was clearly evident. However, it was 
also evident that the individual talker *s mode of list reading (most spe^- 
cifically, his rising or falling contour) can affect relative F^ values in 
stressed and unstressed syllables. In all 120 words, talker ASH produced 
higher peak F^ in the stressed syllable than in the unstressed syllable of 
the word. No such simple distinction was made ty talker Glffl. This is evi- 
dent in the scatter plots of Figure 3, where peak F^ in the second vowel is 
plotted versus peak F^ in the first vowel. Pairs of peak F^ values cluster 
into two completely i&olatable groups for ASH, co3:rre spending to whether the 
first or the second vowel is stressed. Any straight line between those two 
clusters would separate - V2 from - ^2 stress patterns with accur- 
acy. For talker GWH, the two groups overlapped some. 
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The peak value of fundamental frequency in the first vovel a lone separ- 
ates - and - stress patterns of both talkers very well. A simple 
hypothesis (shown by the dashed lines in Figure 3) that "peak value of F^ in 
vowel V^- is greater than 127 Hz if and only if vowel is stressed" yields 
100^ correct stress classification for ASH, and %% correct for GWH« The 
use of any absolute frequency threshold (like 127 Hz) for distinguishing 
between stress patterns would undoubtedly be futile for talkers with differ^ 
ent physiological structiires, and is certainly not advocated here. However, 
the success of such an hypothesis here does illustrate that peak F^ values 
in initial syllables are more reliably related to stress patterns than 
are values in the utterance-final syllables. We would expect initial 
syllables to be. less affected by confusing influences such as the indivi- 
dual's mode. of list reading (his rising and falling terminal contours) and 
the sloppiness, reduced amplitudes, and falling sub-glottal pressure that 
prevail at the ends of utterances. 

The initial values of F^ in both vowels were also studied. As with peak 
values, a simple hypothesis that "initial F^ value in vowel is greater 
than 120 Hz if and only if vowel is stressed" yielded fairly accurate 
stress classification (98% for ASH and 82% for GWH) . These results are ob- 
vlousiy not as good as these with peak F^ values, and are influenced by the 
identitit^a of previous consonants o 

The above results agree with previous studies showing that F^ tends to 
be higher in stressed syllables. However, the effects of stress on the 
transitions in F^ at consonant-vowel and vowel-consonant boundaries are 
also of interest. Various hypotheses are listed in Table 2 which relate fall- 
ing or rising F^ contours at vowel onset to the stress condition of the 
syllable, the voicing/unvoicing of preceding consonants, and the exceptional 
condition where vowel is wrd-initial (as in aster, a stir ; etc.). 

Published works that suggest that "pitch is a voicing cue" (Haggaid, 
Ambler and Callow, 1970; Chistovich, 1969), and the results with [haC^C] utter- 
ances, would suggest that voiced consonants are followed by rising F^ contoiors, 
lAile falling F^ contours follow unvoiced consonants* The simple "consonantal 
hypotheses," listed as hypotheses 1 and 2 in Table Uy state these claims. 
However, the data show that only about half of the falling (rising3 oontovirs 
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result from unvoiced [voiced^ consonants. The one exception was with rising 
contours in second syllables. These were regularly associated with previous 
voiced consonants (in 100$6 of the syllables of ASH, and 8456 of those of GWH) . 

Other hypotheses that weaken the implications of a rising contour, 
(n\mber 3) or that consider stress effects alone (numbers k and 5) are 
given in TalxLe but hypotheses based on the combination of stress and 
voiced environments yielded the most accurate predictions of stress and phon- 
etic context. As hypothesis 6 "shows, a falling contour may be evidence 
either of a preceding unvoiced consonant, or of an unstressed syllable . 
(with previourj unvoiced or voiced consonant) • A rising F^ conto-ur at 
vowel onset may indicate ("ty hypothesis 8) either ttiat the vowel is word- 
initial or preceded by a voiced consonant, or that the syllable is stressed 
(id.th a prevocalic voiced OE unvoiced consonant). These rialiable hypotheses 
are, however, predlctively quite weak. All that they positively eliminate 
from consideration are (I) the possibility of a stressed vowel with preceding 
voiced consonant yielding a falling contoxxrj and (2) the possibility of an 
unstressed vowel td.th previous unvoiced consonant yielding a rising contour. 

It is evident that falling or rising F^ contours at vowel onset do not sim- 
ply mark either stress or state of voicing, but are functions of a complex 
combination of stress and phonetic context. 
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